A Relational Approach to Novelty Detection in Data Streams
نویسندگان
چکیده
A data stream is a sequence of time-stamped data elements which arrive on-line, at consecutive time points. In this work we propose a multi-relational approach to mine complex data streams in order to identify novelty patterns which target new or unknown situations in the stream. Multi-relational data mining is motivated by the existence of several real-world data stream applications where data elements are complex data scattered in several database relations of a relational database. In our proposal a stream is mined according to a data block model, that is, the stream is segmented in a sequence of data blocks where each data block consists of complex data elements arriving in a user define period (e.g., daily or monthly). A relational pattern base is mined each time a new data block arrives in the stream and a time window is used to filter out novelty patterns. An application of the proposed algorithm to the problem of detecting anomalies in network traffic is described
منابع مشابه
Mining Frequent Patterns in Uncertain and Relational Data Streams using the Landmark Windows
Todays, in many modern applications, we search for frequent and repeating patterns in the analyzed data sets. In this search, we look for patterns that frequently appear in data set and mark them as frequent patterns to enable users to make decisions based on these discoveries. Most algorithms presented in the context of data stream mining and frequent pattern detection, work either on uncertai...
متن کاملRelational Frequent Patterns Mining for Novelty Detection from Data Streams
We face the problem of novelty detection from stream data, that is, the identification of new or unknown situations in an ordered sequence of objects which arrive on-line, at consecutive time points. We extend previous solutions by considering the case of objects modeled by multiple database relations. Frequent relational patterns are efficiently extracted at each time point, and a time window ...
متن کاملNovelty Detection from Evolving Complex Data Streams with Time Windows
Novelty detection in data stream mining denotes the identification of new or unknown situations in a stream of data elements flowing continuously in at rapid rate. This work is a first attempt of investigating the anomaly detection task in the (multi-)relational data mining. By defining a data block as the collection of complex data which periodically flow in the stream, a relational pattern ba...
متن کاملCognitively Motivated Novelty Detection in Video Data Streams
Automatically detecting novel events in video data streams is an extremely challenging task. In recent years, machine-based parametric learning systems have been quite successful in exhaustively capturing novelty in video if the novelty filters are well-defined in constrained environments. Some important questions however remain: How close are such systems to human perception? Can results deriv...
متن کاملEfficient Online Novelty Detection in News Streams
Novelty detection in text streams is a challenging task that emerges in quite a few different scenarii, ranging from email threads to RSS news feeds on a cell phone. An efficient novelty detection algorithm can save the user a great deal of time when accessing interesting information. Most of the recent research for the detection of novel documents in text streams uses either geometric distance...
متن کامل